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Abstract 

We discuss a model accounting for the creation and development 
of transport networks based on the Cameo principle which refers to 
the idea of distribution of resources, including land, water, minerals, 
fuel and wealth. We also give an outlook of the use of random walks 
as an effective tool for the investigation of network structures and 
its functional segmentation. In particular, we have studied the 
complex transport network of Venetian canals by means of random 
walks. 



Presentation for the volume: The challenge of complex network 
modelling calls for the more realistic heuristic principles that could catch 
the main features of network creation and development. In the Cameo 
model which refers to the idea of distribution of resources, including 
land, water, minerals, fuel and wealth, the local attractiveness of a site 
determining the creation of new spaces of motion in that is specified by a 
real positive parameter a; > 0. We have described a possible mechanism 
for the emergence and development of complex transport networks based 
on the Cameo principle. Sustained movement patterns are generated by 
a subset of automorphisms of the graph spanning the transport network 
of can be naturally interpreted as random walks. Random walks assign 
absolute scores to all nodes of a graph and embed space syntax into 
Euclidean space. Namely, every route of a transport network can be 
represented by a vector in Euclidean space which length quantifies the 
segregation of the route from the rest of the graph. We have empirically 
observed that the distribution of lengths over the edge connectivity in 
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the spatial network of Venetian canals exhibits scaling invariance 
phenomenon. The method is applicable to any transport network. 



1 Introduction 

Physics (Greek: Lpvai^, [phusis], nature) is the branch of science con- 
cerned with the characterization of universal laws of Nature portraying 
its logically ordered picture in agreement with experience. Theoretical 
physics is closely related to mathematics, which provides a language for 
physical theories and allows for a rationalization of thought by making 
it possible to formulate these laws in terms of mathematical relations. 
Physicists study a wide variety of phenomena creating new interdisci- 
plinary research fields by applying theories and methods originally devel- 
oped in physics in order to solve problems in economics, social science, 
biology, medicine, technology, etc. In their turn, these different branches 
of science inspire the invention of new concepts in physics. A basic tool 
of analysis, in such a context, is the mathematical theory of complexity 
concerned with the study of complex systems including human economies, 
climate, nervous systems, cells and living things, including human beings, 
as well as modern energy or communication infrastructures which are all 
networks of some kind. 

Complex systems appear as a result of the interplay between Topology 
determined by a connected graph. Dynamics described by the operators 
invariant with respect to graph symmetry, and properties of embedding 
(Euclidean) space specified by a set of measures and weights assigned to 
elements of the graph. In the context of complex networks theory created 
by physicists, the non-trivial topological structure of large networks is 
investigated by means of various statistical distributions. The structure 
and the properties of complex networks essentially depend on the way 
how nodes get connected to each other. Random graphs with a scale 
free distribution for the degree seem to appear very frequently in a great 
variety of real life situations like the World- Wide Web, the Internet, 
social networks, linguistic networks, citation networks and biochemical 
networks. 

In most of complex networks emerging in society and technology, each 
node has a feature which attracts the others. In a class of simple mod- 
els proposed in [1], the network dynamics can be described in terms of 
property of the node and the affinity other nodes have towards that prop- 
erty (Cameo graphs). Networks built accordingly to this principle have a 
degree distribution with a power law tail, whose exponent is determined 
only by the nodes with the largest affinity values. It appears that the ex- 
tremists lead the formation process of the network and manage to shape 
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the final topology of the system. 

The exceptional events play a crucial role in the formation of network 
structures [5]. The dynamics of some vertices the "hubs" which have 
an extremely high number of connections to other vertices is of primary 
importance for complex networks. These networks are generally "scale- 
free" ; in other words, they exhibit architectural and statistical stability as 
the degree distribution grows. A class of probabilistic model for a system 
at a threshold of instability has been studied in [3| . The distribution of 
residence times below the threshold characterizes the properties of such a 
system. Being at a threshold of instability, the system can induce various 
types of random graphs and the scale free random graphs among others 
[4]. The priority-based scheduling rules in single-stage queuing systems 
(QS) also generate fat tail behavior for the task waiting time distributions 
induced by the waiting times of very low priority tasks that stay unserved 
almost forever as the task priority indices are "frozen" in time [5]. The 
task waiting time distributions have been studied for a population-type 
model with an age structure and a QS with deadlines assigned to the 
incoming tasks, which is operated under the "earliest-deadline-first" pol- 
icy. As the aging mechanism ultimately assigns high priority to any long 
waiting tasks, fat tails cannot find their origin in the scheduling rule alone. 

Graphs obtained by successive creation and elimination of edges into 
small neighborhoods of the vertices evolve towards small world graphs 
with logarithmic diameter, high clustering coefficients and a fat tail dis- 
tribution for the degree [6] . It is important to note that it was only local 
edge formation processes that rise small worlds, no preferential attach- 
ment was used. Simple edge generation rules based on an inverse like 
mass action principle for random graphs over a structured vertex set, un- 
der very weak assumptions on the structure generating distribution, also 
yield a scale free distribution for the degree |7|. A local search principle 
important in many social applications, "my friends are your friends" have 
also been introduced and studied; networks generated in accordance to 
such a principle have essentially high clustering coefficients. 

Although investigations into the statistical properties of graphs such 
as a heavy-tail in the degree distribution of nodes could uncover their 
hierarchical structure, they are futile if the detailed information on the 
structure of graphs is of primary interest since many graphs character- 
ized by similar statistics of node degrees and shortest path lengths can 
be of dramatically different structures. The structure and symmetry of 
graphs play the crucial role in behavior of dynamical systems defined on 
that. It was clearly demonstrated in epidemiological research describing 
the dynamics of sexually transmitted diseases, the Human Immune Defi- 
ciency Virus (HIV) and AIDS, in particular [5]. Mathematical modelling 
on the spread of sexually transmitted diseases [5]- [13] studied on various 
random graphs displays the importance of critical parameters such as the 
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transmission probability and edge creation probability for the epidemic 
spreading. 

It has been found that the epidemic spreading in scale-free networks is 
very sensitive to the statistics of degree distribution, the effective spread- 
ing rate, the social strategy used by individuals to choose a partner, and 
the policy of administrating a cure to an infected node [2]. Depending 
on the interplay of these four factors, the stationary fractions of infected 
population as well as the epidemic threshold properties can be essentially 
different. For a model of scale- free graphs with biased partner choice 
that knowing the exponent for the degree distribution is in general not 
sufficient to decide epidemic threshold properties for exponents less than 
three [15] . Absence of epidemic threshold happens precisely when a posi- 
tive fraction of the nodes form a cluster of bounded diameter. Probably, 
it is impossible to obtain a simple immunization program that can be si- 
multaneously effective for all types of scale- free networks 14J. A similar 
approach can be applied in order to study social diseases like corruption. 
It has been investigated in (16j as a generalized epidemic process on the 
graph of social relationships. Corruption is characterized by a strong non- 
linear dependence of the transmission probability from the local density 
of corruption and the mean field influence of the overall corruption in 
the society. Network clustering and the degree-degree correlation play an 
essential role in these types of dynamics. In particular, it follows that 
strongly hierarchically organized societies are more vulnerable to corrup- 
tion than democracies. A similar type of modelling can be applied to 
other social contagion spreading processes like opinion formation, doping 
usage, social disorders or innovation dynamics. An agent-based model 
of factual communication in social systems, drawing on concepts from 
Luhmann's theory of social systems [17] has been studied in [18]. The 
agent communications are defined by the exchange of distinct messages. 
Message selection is based on the history of the communication and de- 
veloped within the confines of the problem of double contingency. We 
have examined the notion of learning in the light of the message-exchange 
description. 

Topology plays the primary role in the dynamical processes which have 
place on networks. The investigations in transitions to spatio-temporal 
intermittency in random network of coupled Chate - Manneville maps 
|19j show that spatiotemporal intermittency occurs for some intervals or 
windows of the values of the network connectivity, coupling strength, and 
the local parameter of the map. Within the intermittency windows, the 
system exhibits periodic and other nontrivial collective behaviors. Ge- 
netic regulatory networks constitute an important example of dynamical 
systems defined on graphs. Local dynamics of network nodes exhibits mul- 
tiple stationary states and oscillations depending crucially upon the global 
topology of a 'maximal' graph (comprising of all possible interactions be- 
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tween genes in the network) [5D] . The long time behavior observed in the 
network defined on the homogeneous 'maximal' graphs is featured by the 
fraction of positive interactions (activations) allowed between genes. In 
networks defined on the inhomogeneous directed graphs depleted in cy- 
cles, no oscillations arise in the system even if the negative interactions 
(inhibitions) in between genes present therein in abundance. Local dy- 
namics observed in the inhomogeneous scalable regulatory networks is less 
sensitive to the choice of initial conditions. 

In mathematics, the automorphism groups of a graph are studied. 
They characterize its symmetries, and are therefore very useful in de- 
termining certain of its properties. In particular, the Euclidean metric 
related to dynamics can be defined on some graphs by means of linear 
operators remaining invariant under the permutations of nodes and sat- 
isfying some conservation properties. These operators describe certain 
dynamical processes defined on graphs such as random walks and diffu- 
sions. We have studied transport through generalized trees in [2T] . Trees 
contain the simple nodes and super-nodes, either well-structured regular 
subgraphs or those with many triangles. We observe super-diffusion for 
the highly connected nodes while it is Brownian for the rest of the nodes. 
Transport within a super-node is affected by the finite size effects van- 
ishing as iV ^ oo. For a space of even dimensions, d — 2,4,6..., the 
finite size effects break down the perturbation theory at small scales and 
can be regularized by using a heat-kernel expansion. Diffusion processes 
and Laplace operators related to them can be used in order to investi- 
gate the structure of networks in the spirit of spectral graph theory. In 
[22], different models of random walks on the dual graphs of compact 
urban structures are considered. Dual graphs have been widely used in 
the framework of space syntax theories [23j for the analysis of spatial 
configurations. The general idea is that spaces can be broken down into 
components, analyzed as networks of choices, and then represented as 
maps and graphs that describe the relative connectivity and integration 
of those spaces. From these components it is thought to be possible to 
quantify and describe how easily navigable any space is, useful for the de- 
sign of museums, airports, hospitals, and other settings where way finding 
is a significant issue. Space syntax has also been applied to predict the 
correlation between spatial layouts and social effects such as crime, traffic 
flow, sales per unit area, etc. Analysis of access times between streets 
performed in [22; helps to detect the city modularity. 

The aim of the present paper is twofold. First, we discuss a model 
which accounts for the creation and development of transport networks 
basing on the Cameo principle [1] which refers to the idea of distribution 
of resources, including land, water, minerals, fuel and wealth in general 
(see Sec.[2|). Second, we give an outlook of the use of random walks as an 
effective tool for the investigation of network structures and its functional 
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segmentation (Sec. [3]). In the SecjH we consider two examples of graphs 
(the modelling example of the Petersen graph and the spatial network of 
Venetian canals) and analyze their properties. We conclude in the last 
section. 



2 The Cameo principle and the origin of trans- 
port networks 

Among the classical models in which the degree distribution of the aris- 
ing graph satisfies a power-law is the graph generating algorithms based 
on the preferential attachment approach firstly proposed by H. Simon [24j . 
Within preferential attachment algorithms, the growth of a network starts 
with an initial graph of rig > 2 nodes such that the degree of each node 
in the initial network is at least 1. The celebrated Barabasi- Albert model 
|25| have been proposed in order to model the emergency and growth of 
scale-free complex networks. New nodes in the model [35] are added to 
the network one at a time. Each new node is connected to n of the exist- 
ing with a probability that is biased being proportional to the number of 
links that the existing node already has, 

deg(i) , , 

Ej=ideg(j) 

It is clear that the nodes of high degrees tend to quickly accumulate 
even more links representing a strong preference choice for the emerging 
nodes, while nodes with only a few links are unlikely to be chosen as the 
destination for a new link. The preferential attachment forms a positive 
feedback loop in which an initial random degree variation is magnified 
with time, [55]. It is fascinating that the expected degree distribution 
in the graph generated in accordance to the algorithm proposed in |25| 
asymptotically approaches the cubic hyperbola, 

Vi[ie^\deg{i) = k]~ ^. (2) 

It is however obvious that the mechanisms governing the creation and 
development of transport networks certainly do not follow such a simple 
preferential attachment principle as that discussed in |25j . Indeed, nowa- 
days the new transportation routes are usually created as a result of the 
subdivision or redevelopment of an existing transport network. Appear- 
ing due to the complicated trade-off processes between multiple objectives, 
they can hardly be planed in such a way as to meet the transportation 
routes that already have the ever maximal number of junctions with other 
routes in the network. 
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The challenge of complex network modelling calls for the more realistic 
heuristic principles that could catch the main features of network creation 
and development. It is clear that a prominent model should take into 
account the structure of embedding physical space: the size and shape of 
landscape, and the local land use patterns if a city transportation network 
is considered. A suitable algorithm describing the development of complex 
networks which takes into account the properties of the surrounding place 
has been recently proposed in [T] . It is called the Cameo-principle having 
in mind the attractiveness, rareness and beauty of the small medallion 
with a profiled head in relief called Cameo. It is exactly their rareness 
and beauty which gives them their high value. 

In the Cameo model [1] , the local attractiveness of a site determining 
the creation of new spaces of motion in that is specified by a real positive 
parameter uj > 0. Indeed, it is rather difficult if ever be possible to 
estimate exactly the actual value coii) for any site i g C5 in the urban 
pattern, since such an estimation can be referred to both the local believes 
of city inhabitants and may be to the cultural context of the site that 
may vary over the different nations, historical epochs, and even over the 
certain groups of population. 

Therefore, in the framework of the probabilistic approach, it seems 
natural to consider the value a; as a real positive independent random 
variable distributed over the vertex set of the graph representation of the 
site uniformly in accordance to a smooth monotone decreasing probability 
density function / (uj). Let us suggest that there is just a few distinguished 
sites which are much more attractive then an average one in the city, so 
that the density function / has a right tail for large uj ^ H; such that 

/M « /M- 

Each newly created space of motion i (represented by a node in the 
dual city graph ^{N) containing N nodes) may be arranged in such a 
way to connect to the already existed space j G 25 (TV) depending only on 
its attractiveness Lo{j) and is of the form 

with some a G (0, 1). The assumption ([3]) implies that the probability to 
create the new space adjacent to a space j scales with the rarity of sites 
characterized with the same attractivity ui as j. 

The striking observation under the above assumptions is the emergence 
of a scale- free degree distribution independent of the choice of distribution 
f{u)). Furthermore, the exponent in the asymptotic degree distribution 
becomes independent of the distribution /(w) provided its tail, /(w) <C 
/((D), decays faster then any power law. 

In the model of growing networks proposed in [IJ, the initial graph ©o 
has A^o vertices, and a new vertex of attractiveness u taken independently 
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uniformly distributed in accordance to the given density /(w) is added to 
the ah'eady existed network at each time chck t G Z+. Being associated 
to the graph, the vertex estabhshes fcg > connections with other vertices 
already present in that. All edges are formed accordingly to the Cameo 
principle 

The main result of ^1 is on the probability distribution that a randomly 
chosen vertex i which had joined the Cameo graph © at time r > with 
attractiveness uj{i) amasses precisely k links from other vertices which 
emerge by time t > t. It is important to note that in the Cameo model 
the order in which the edges are created plays a role for the fine structure of 
the graphs. The resulting degree distribution for — r > fc/fco is irrelevant 
to the concrete form of /(w) and reads as following 



Pr 



j: T{j)>T 



''o^" .lni/"f-). (4) 



y^l + l/Q + o(l) 



In order to obtain the asymptotic probability degree distribution for an 
arbitrary node as i — > cx), it is necessary to sum ((4]) over all t < t that 
gives 

Pik)-- J2 fcl + l/a+o(l) 1^'^" [-) ^ ^l+l/a+o(l)- (5) 

0<T<t ^ ^ 

The emergence of the power law ^ demonstrates that graphs with a scale- 
free degree distribution may appear naturally as the result of a simple edge 
formation rule based on choices where the probability to chose a vertex 
with affinity parameter uj is proportional to the frequency of appearance 
of that parameter. If the affinity parameter uj is itself power law like 
distributed one could also use a direct proportionality to the value uj to 
get still a scale free graph. 

We have described a possible mechanism for the emergence and devel- 
opment of complex transport networks based on the Cameo principle. In 
the forthcoming section, we discuss the embedding of transport network 
into the {N — 1)— dimensional Euclidean space which facilitates the dis- 
covering of important nodes, their classification, and the coarse- graining. 



3 Mathematics of transport networks 

Any graph representation naturally arises as the outcome of a cate- 
gorization, when we abstract a real world system by eliminating all but 
one of its features and by grouping together things (or places) sharing a 
common attribute. For instance, the common attribute of all open spaces 
in city space syntax is that we can move through them. All elements 
called nodes that fall into one and the same group V are considered as 
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essentially identical; permutations of them within the group are of no con- 
sequence. The symmetric group Sat consisting of all permutations of N 
elements {N being the cardinality of the set V) constitute therefore the 
symmetry group oiV. If we denote hy E <Z V xV the set of ordered pairs 
of nodes called edges, then a graph is a map G{V, E) : i? ^ if C R_|_ (we 
suppose that the graph has no multiple edges). If two nodes are adjacent, 
€ E we write i ~ j. 



3.1 The right choice for graph representation 

First, we establish a connection between transport flows on the graph 
G and random walks on its dual counterpart G* . 

Given a connected undirected graph G{V,E), in which V is the set 
of nodes and E is the set of edges, we introduce the traffic function / : 
E (0,00 [ through every edge e £ E. It then follows from the Pcrron- 
Frobenius theorem '^Fl] that the linear equation 

/(e) - E /(^') exp(-/i^(e')), (6) 

e' ~ e 

where the sum is taken over all edges e' € E which have a common node 
with e, has a unique positive solution /(e) > 0, for every edge e € E, 
for a fixed positive constant h > and a chosen set of positive metric 
length distances i{e) > 0. This solution is naturally identified with the 
traffic equilibrium state of the transport network defined on G, in which 
the permeability of edges depends upon their lengths. The parameter h 
is called the volume entropy of the graph G, while the volume of G is 
defined as the sum 

Vol(G) = i ^ £ie). 

The volume entropy h is defined to be the exponential growth of the balls 
in a universal covering tree of G with the lifted metric, p5]-[31j. 

The degree of a node i ^ V is the number of its neighbors in G, 
degQ(i) = fcj. It has been shown in [3T] that among all undirected con- 
nected graphs of normalized volume, Vol(G) — 1, which are not cycles 
and for which ki 7^ 1 for all nodes (no cul-de-sacs), the minimal value of 
the volume entropy, mm{h) — i X)ie\/ l^S (^i ~ 1) is attained for the 
length distances 

2 mm(/i) 

where ki and kj are the degrees of the nodes linked by e E E. It is then ob- 
vious that substituting ([7]) and min{h) into © the operator exp {—h£{e')) 
is given by a symmetric Markov transition operator. 



/(e) = E . '^^"'^ (8) 
^ v/(fc.-l) (%-!)' 
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where i and j are the nodes linked by e' G E, and the sum in ([S]) is taken 
over aU edges e' G E which share a node with e. The symmetric operator 
([5]) rather describes time reversible random walks over edges than over 
nodes. In other words, we are invited to consider random walks described 
by the symmetric operator defined on the dual graph G* . 

The Markov process ([5]) represents the conservation of the traffic vol- 
ume through the transport network, while other solutions of ([6]) are re- 
lated to the possible termination of travels along edges. If we denote the 
number of neighbor edges the edge e E E has in the dual graph G* as 
Qe = deg(5*(e), then the simple substitution shows that ^(e) — ^Jql de- 
fines an eigenvector of the symmetric Markov transition operator defined 
over the edges E with eigenvalue 1. This eigenvector is positive and being 
properly normalized determines the relative traffic volume through e € E 
at equilibrium. 

Eq.® shows the essential role Markov's chains defined on edges play 
in equilibrium traffic modelling and emphasizes that the degrees of nodes 
are a key determinant of the transport networks properties. 

The notion of traffic equilibrium had been introduced by J.G. Wardrop 
in |32| and then generalized in [33] to a fundamental concept of network 
equilibrium. Wardrop's traffic equilibrium is strongly tied to the human 
apprehension of space since it is required that all travellers have enough 
knowledge of the transport network they use. The human perception 
of places is not an entirely Euclidean one, but are rather related to the 
perceiving of the vista spaces (viewable spaces of streets and squares) 
as single units and to the understanding of the topological relationships 
between these vista spaces, [34] . 

The use of Eq.([5]) also helps to clarify the inconsistency of the tradi- 
tional axial technique widely implemented in space syntax theory. Lines 
of sight are oversensitive to small deformations of the grid, which leads 
to noticeably different axial graphs for systems that should have similar 
configuration properties. Long straight paths, represented by single lines, 
appear to be overvalued compared to curved or sinuous paths as they are 
broken into a number of axial lines that creates an artificial differentiation 
between straight and curved or sinuous paths that have the same impor- 
tance in the system ^5\. Eq.® shows that the nodes of a dual graph 
representing the open spaces in the spatial network of an urban environ- 
ment should have an individual meaning being an entity characterized by 
the certain traffic volume capacity. 

Decomposition of city space into a complete set of intersecting open 
spaces characterized by the traffic volume capacities produces a spatial 
network which we call the dual graph representation of a city. 
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3.2 The processes associated with permutational au- 
tomorphisms of the graph 

While analyzing a graph, whether it is primary or dual, we assign the 
absolute scores to all nodes based on their properties with respect to a 
transport process defined on that. Indeed, the nodes of G{V, E) can be 
weighted with respect to some measure m = X^iey ™i %i specified by a 
set of positive numbers rui > 0. The space £^{711) of square-summable 
functions with respect to the measure m is a Hilbert space Ti-iV). 

Among all measures which can be defined on V, the set of normalized 
measures (or densities), 

l = $^7ri^i,-, (9) 

iev 

are of essential interest since they express the conservation of a quantity, 
and therefore may be relevant to a physical process. 

The fundamental physical process defined on the graph is generated 
by the subset of its automorphisms preserving the notion of connectivity 
of nodes. An automorphism is a mapping of the object to itself preserving 
all of its structure. The set of all automorphisms of a graph forms a group, 
called the automorphism group. For each graph G{V,E), there exists a 
unique, up to permutations of rows and columns, adjacency matrix A, the 
Nx N matrix defined by Aij = 1 Hi ^ j, and Aij = otherwise. As usual 
A is identified with a linear endomorphism of Co(G), the vector space of 
all functions from V into R. The degree of a node i gV is therefore equal 
to 

Let us consider the set of all linear transformations defined on the adja- 
cency matrix, 

N 

Z{K).. = ^ Tii,iA,u Tijsi e M, (11) 
s,;=i 

generated by the subset of automorphisms of the graph G. 

The graph automorphisms are specified by the symmetric group S^r 
including all admissible permutations p e §jv taking i € V to p{i) G V. 
The representation of Sjv consists of all N x N matrices Tip, such that 
i^pkpii) = 1' and (np)^_^ = if J / 

The function Z {A)-j should satisfy 

Uj Z (A) Up = Z {n]; AUp) , (12) 

for any p G E>n, and therefore entries of the tensor must have the 
following symmetry property, 

•^Pii) P{j) P(s) p{l) — ^ijsh (13) 
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for any p gSn- Since the action of the symmetric group Sat preserves the 
conjugate classes of index partition structures, any appropriate tensor 
satisfying (|13p can be expressed as a Unear combination of the following 
tensors: {1, Sij,5is,Sa,Sjs,Sji,5si,5ijSjs,SjsSsi,SsiSii,SuSij, 
SijSsi,SisSji,diiSjs,SijSiiSis} ■ Given a simple, undirected graph G such 
that An = for any i ^ V then by substituting the above tensors into 
(jlip and taking account on symmetries we conclude that any arbitrary 
linear permutation invariant function must be of the form 

Z (A) = ai+ 6ij (a2 + a^kj) + 04 , (14) 

with kj = degQ(j) and 01,2,3,4 arbitrary constants. 

If we require that the linear function Z preserves the notion of con- 
nectivity, 

fc. -^Z(A),^., (15) 

it is clear that we should take Ci = 02 = (indeed, the contributions 
aiN and 02 are incompatible with (|15p ) and then obtain the relation for 
the remaining constants, 1 — 03 = 04. Introducing the new parameter 
/3 = 04 > 0, we write (|14l) as follows. 



ZiA)^^ ^ {1-P)5,,kj+PA,. (16) 

If we express (|15|) in the form of the probability conservation relation, then 
the function Z (A) acquires a probabilistic interpretation. Substituting 
(fTBl) back into (fT5l). we obtain 



1 _ V 



J2^^^{l-P)6,,+(3^ (17) 

Y- 7^(/3) 



The operator T^p represents the generalized random walk transition op- 
erator if < /3 < fc~ax where fcmax is the maximal node degree in the 
graph G. In the random walks defined by Tj^^ , a random walker stays in 
the initial vertex with probability 1 — /?, while it moves to another node 
randomly chosen among its nearest neighbors with probability (3/ki. If 
we take l3 — I, then the operator T.j^P describes the usual random walks 
discussed extensively in the classical surveys |36j-|38|. 

Being defined on a connected aperiodic graph, the matrix is a real 
positive stochastic matrix, and therefore, in accordance to the Perron- 
Frobenius theorem [57], its maximal eigenvalue is 1, and it is simple. A 
left eigenvector 

7rT('3) = TT (18) 
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associated with the eigenvalue 1 has positive entries satisfying It is 
interpreted as a unique equiHbrium state tt (stationary distribution of the 
random walk). For any density a e Ti-iV), 

TT = lim crT*. (19) 



3.3 Transport network as Euclidean space 

Markov's operators on Hilbert space appear therefore as the natural 
language of complex network theory and space syntax theory in particu- 
lar. Now we demonstrate that random walks embed connected undirected 
graphs into Euclidean space, in which distances and angles acquire the 
clear statistical interpretations. 

The Markov operator T is self-adjoint with respect to the normalized 
measure ([9]) associated to the stationary distribution of random walks tt, 

f = i (ttI/^ T 7r-l/2 + 71-1/2 J.T ^1/2^ ^ (20) 

where is the transposed operator. In the theory of random walks de- 
fined on graphs [36', '38] and in spectral graph theory }39j , basic properties 
of graphs are studied in connection with the eigenvalues and eigenvectors 
of self-adjoint operators defined on them. The orthonormal ordered set 
of real eigenvectors ipi, i = 1 . . . N , of the symmetric operator T defines a 
basis in ?{{¥). 

In particular, the symmetric transition operator T of the random walk 
defined on connected undirected graphs is 



T,^ = I yfc.fc. . . (21) 

[0, I -J. 

Its first eigenvector tpi belonging to the largest eigenvalue fii — 1, 

^Pif = ^,, V'M = TT,, (22) 

describes the local property of nodes (connectivity) , since the stationary 
distribution of random walks is 

where 2M = X^iev The remaining eigenvectors, {V's}fL2J belonging 
to the eigenvalues l>/i2>---MAr>— 1 describe the (?Zo6a/ connectedness 
of the graph. For example, the eigenvector corresponding to the second 
eigenvalue /i2 is used in spectral bisection of graphs. It is called the Fiedler 
vector if related to the Laplacian matrix of a graph [55] . 
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Markov's symmetric transition operator T defines a projection of any 
density a G 'H{V) on the eigenvector ipi of the stationary distribution vr, 

(jf = i^i+a^f, o-i=o--V'i, (24) 

in which cr^ is the vector belonging to the orthogonal complement of -01. 
In space syntax, we are interested in a comparison between the densities 
with respect to random walks defined on the graph G. Since all compo- 
nents ■01 > 0, it is convenient to rescale the density a by dividing its 
components by the components of -01, 

<?i,= (25) 

Thus, it is clear that any two rescaled densities a^p^TL differ with 
respect to random walks only by their dynamical components, 

{d~p)T' = {a^ - p^) f\ 

for all i > 0. Therefore, we can define the distance || . . . ||t between any 
two densities established by random walks by 

\\cT-p\\l = Y,{a^-p^\f'\a^-p^). (26) 
t > 

or, using the spectral representation of T, 

h'PWl =T.t>,l:t2^^^{^^-~p^\i^s){i^s\5^-p^) 



s=2 r 



where we have used Diracs bra-ket notations especially convenient for 
working with inner products and rank-one operators in Hilbert space. 
If we introduce a new inner product for densities (t, p e ^(^) by 

,^ ^ {a^\i^s){ilJs\pi) I . 

[(^,P)t = 7—- ' (28) 

then ([27| is nothing else but 

Ik-P||T = lkllT + IIPllT-2(fT,p)^, (29) 

where 



^ 1 - Ai.s 



s=2 ^ 



is the square of the norm of cr € ^(^) with respect to random walks 
defined on the graph G. 
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We finish the description of the {N — l)-dimensional Euclidean space 
structure of G induced by random walks by mentioning that given two 
densities a, p £ Ti-iV), the angle between them can be introduced in the 
standard way, 

cosZ(p,a) = ^fl^l^. (31) 

Random walks embed connected undirected graphs into the Euclidean 
space R^^^. This embedding can be used in order to compare nodes and 
to construct the optimal coarse-graining representations. 

Namely, in accordance to (|30p. the density Si, which equals 1 a,t i G V 
and zero otherwise, acquires the norm || Si \\rp associated to random walks 
defined on G. In the theory of random walks [36j . its square, 



N i2 



It - ^ i: (32) 



s=2 

gets a clear probabilistic interpretation expressing the access time to a 
target node quantifying the expected number of steps required for a ran- 
dom walker to reach the node i £ V starting from an arbitrary node 
chosen randomly among all other nodes with respect to the stationary 
distribution tt. 

The Euclidean distance between any two nodes of the graph G cal- 
culated as the distance ([27]) between the densities Si and Sj induced by 
random walks, 



,33, 

quantifies the commute time in theory of random walks being equal to the 
expected number of steps required for a random walker starting at i £ V 
to visit j £ V and then to return back to i, 36J. 

It is important to mention that the cosine of an angle calculated in 
accordance to (I5T]) has the structure of Pearson's coefficient of linear cor- 
relations that reveals it's natural statistical interpretation. Correlation 
properties of flows of random walkers passing by different paths have 
been remained beyond the scope of previous studies devoted to complex 
networks and random walks on graphs. The notion of angle between any 
two nodes of the graph arises naturally as soon as we become interested 
in the strength and direction of a linear relationship between two random 
variables, the flows of random walks moving through them. If the cosine 
of an angle (|3ip is 1 (zero angles), there is an increasing linear relation- 
ship between the flows of random walks through both nodes. Otherwise, 
if it is close to -1 (tt angle), there is a decreasing linear relationship. The 
correlation is {tt/2 angle) if the variables are linearly independent. It 
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is important to mention that as usual the correlation between nodes does 
not necessary imply a direct causal relationship (an immediate connec- 
tion) between them. 



4 Examples: Petersen graph and the net- 
work of Venetian Canals 

In the present section, we construct the Euclidean embedding of two 
graphs. One graph we study is the Petersen graph of 10 nodes (see Fig.[T|). 
Another example is the spatial network of 96 Venetian canals which serve 
the function of roads in the ancient city that stretches across 122 small 
islands (see Fig. [2]). While identifying a canal over the plurality of water 
routes on the city map of Venice, the canal-named approach has been 
used, in which two different arcs of the city canal network were assigned 
to the same identification number provided they have the same name. 
The Petersen graph is a regular graph, ki = 3, i = 1,...10, so that 



J2i ki = 30, and the stationary distribution of random walks is uniform, 
7r|^°*-' =0.1. The spectrum of the random walk transition operator (I21|) 
contains the Perron eigenvalue 1 which is simple, then the eigenvalue 1/3 
of multiplicity 5, and —2/3 of multiplicity 4. Therefore, there are just 3 
linearly independent eigenvectors, and two eigensubspaces for which the 
orthonormal basis vectors can be calculated, so that the resulting matrix 
of eigenvectors and basis vectors which we use in p2ll33|) always has full 
column dimension. Random walks embed the Petersen graph into a 9- 
dimensional Euclidean space, in which all nodes have equal norm ([32|) . 
\\i\\rp = 3.14642654 that means the expected number of steps a random 




Figure 1 



The Petersen graph. 
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Figure 2: The dual graph representation of the spatial network of 96 Venetian 
canals 

walker starting from a node chosen randomly with probabihty p = 0.1 
reaches any node in the Petersen graph equals 9.9. Indeed, the structure 
of 9-dimensional vector space induced by random walks defined on the 
Petersen graph cannot be represented visually, however if we choose a 
node as a point of reference, we can draw its 2-dimensional projection 
by arranging other nodes at the distances calculated accordingly to ((551) 
and under the angles found from PT|) they are with respect to the chosen 
reference node (see Fig. [3]). 

It is expected that a random walker starting at #1 visits any periphery 
node (#2,3,4,5) and then returns back in 18 random steps, while it is 
required 24 random steps in order to visit any internal node in the deep of 
the graph (#6, 7, 8, 9, 10). It is also obvious that while the linear relation- 
ship between the random walks flows through #1 and those through the 
periphery nodes is positive, it is negative with respect to the flows passing 
through the internal nodes. Due to the symmetry of the Petersen graph, 
the figure displayed on Fig. [3] is essentially the same if we draw it with 
respect to any other periphery node (#2,3,4,5). It is also important to 
note that it turns to be mirror-reflected if we draw it with respect to any 
internal node (#6,7,8,9,10). Therefore, we can conclude that the Pe- 
tersen graph contains two groups of nodes, at the periphery and in deep. 
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Figure 3: The Euclidean space embedding of the Petersen graph drawn with 
respect to the node #1. 

which appears to be "a quarter" higher segregated from each other (18 
random steps vs. 24 random steps). It is clear that the 9— dimensional 
embedding of the Petersen graph into Euclidean space is characterized by 
the highest degree of symmetry. 

The graph representation of the spatial network of Venetian canals 
(Fig [2]) is much more complicated than the Petersen graph. The graph 
is far from being regular, so that the stationary distribution of random 
walks defined on it is not uniform. In [22] . we have discussed that it is 
not evident that the degree distributions in compact urban patterns and 
in Venice, in particular, follow a power law. The spectrum of the Markov 
transition operator (PT|) defined on that is presented in Fig.[31 The matrix 
(|2T|) for the canal network in Venice is strongly defective. In particular, it 
contains the eigenvalue fi — of multiplicity 22. This degeneracy indicates 
the presence of the complete bipartite subgraph in the spatial network of 
Venice shown in Fig. [2l The norms of canals with respect to random walks 
are different and scales with their connectivity (see Fig. [S]). The notion 
of spatial segregation acquires a statistical interpretation with respect to 
random walks by means of p2[) . In urban spatial networks encoded by 
their dual graphs, the access times strongly vary from one open space 
to another and could be very large for statistically segregated spaces. 
Three data points characterized by the shortest access times shown in 
Fig. [5] are due to the Lagoon of Venice, the Giudecca canal, and the 
Grand canal - the most central water routes in the city canal network. 
Four data points characterized by the worst accessibility represent the 
canal subnetwork of Venetian Ghetto. The slope of the regression line 
equals 2.07. 

The 2-dimensional projection of the Euclidean space of 96 Venetian 
canals set up by random walks drawn for the the Grand Canal of Venice 
(the point (0, 0)) is shown in Fig. [Sj Nodes of the dual graph representa- 
tion of the canal network in Venice are shown by disks with radiuses taken 
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Figure 4: The spectrum of the Markov transition operator (|2ip defined on the 
spatial network of Venetian canals. 

equal to the degrees of the nodes. All distances between the chosen origin 
and other nodes of the graph (Fig. ^ have been calculated in accordance 
to ((33)) and |3T|) has been used in order to compute angles between nodes. 
Canals negatively correlated with the Grand Canal of Venice are set under 
negative angles (below the horizontal), and under positive angles (above 
the horizontal) if otherwise. 

It is evident from Fig. [5] that disks of smaller radiuses demonstrate a 
clear tendency to be located far away from the origin being characterized 
by the excessively long commute times with the reference point (the Grand 
canal of Venice), while the large disks which stand in Fig. [6] for the main 
water routes are settled in the closest proximity to the origin that intends 
an immediate access to them. 

5 Discussion and Conclusion 

In the present paper, we have developed a self-consistent approach to 
modelling of complex networks. We have discussed the possible creation 
algorithm (see Sec. [5]) which refers to the idea of distribution of resources, 
including land, water, minerals, fuel and wealth in general rather than 
to the popularity driven preferential attachment approach proposed in 
[251 [55] . We have also demonstrated that random walks are the effective 
tool for investigation of the graph structure since they embed a connected 
graph into Euclidean space, in which the distances and angles acquire the 
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Figure 5: The scatter plot of the connectivity vs. the norm a node in the dual 
graph representation of 96 Venetian canals acquires with respect to random 
walks. Three data points characterized by the shortest access times represent 
the main water routes of Venice: the Lagoon of Venice, the Giudecca canal, and 
the Grand canal. Four data points of the worst accessibility are for the canal 
subnetwork of Venetian Ghetto. The slope of the regression line equals 2.07. 



statistical interpretations. 

Probably, the most important conclusion of space syntax theory is that 
the adequate level of the positive relationship between the connectivity 
of city spaces and their integration property (vs. segregation) called in- 
telligibility encourages peoples way- finding abilities [53] . Intelligibility of 
Venetian canal network reveals itself quantitatively in the scaling of the 
norms of nodes with connectivity shown in Fig. [5] and qualitatively in the 
tendency of smaller disks to be located on the outskirts of the Venetian 
space syntax displayed in Fig[Sl 
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Figure 6: The 2-dimensional projection of tlie 95-dimensional Euclidean spaces 
associated to random walks defined on the city canal network built from the 
perspective of the Grand canal of Venice chosen as the origin. The labels of the 
horizontal axes display the expected number of random walk steps. The labels 
of the vertical axes show the degree of nodes (radiuses of the disks). 
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